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Abstract. We initiate tlie study of the Bipartite Contraction prob- 
lem from the perspective of parameterized complexity. In this problem 
we are given a graph G and an integer k, and the task is to determine 
whether we can obtain a bipartite graph from G by a sequence of at most 
k edge contractions. Our main result is an f{k) nP^^^ time algorithm for 
Bipartite Contraction. Despite a strong resemblance between Bipar- 
tite Contraction and the classical Odd Cycle Transversal (OCT) 
problem, the methods developed to tackle OCT do not seem to be di- 
rectly applicable to Bipartite Contraction. Our algorithm is based 
on a novel combination of the irrelevant vertex technique, introduced by 
Robertson and Seymour, and the concept of important separators. Both 
techniques have previously been used as key components of algorithms 
for fundamental problems in parameterized complexity. However, to the 
best of our knowledge, this is the first time the two techniques are applied 
in unison. 



1 Introduction 

Odd Cycle Transversal (OCT) is a central problem in parameterized com- 
plexity. The establishment of its fixed parameter tractability by Reed, Smith, 
and Vetta [20] in 2004, settling a long-standing open question [6], supplied the 
field with the powerful new technique of iterative compression [18]. OCT and 
the closely related Edge Bipartization problem take a graph G and an inte- 
ger k as input, and ask whether a bipartite graph can be obtained by deleting 
at most k vertices, respectively k edges, from G. These two problems can be 
viewed as two ways of measuring how close G is to being bipartite. Over the 
last few years a considerable amount of research has been devoted to studying 
different measures of how close a graph is to being bipartite [7,8,13,12], and 
how similarity to a bipartite graph can be exploited [4]. 

A natural similarity measure is defined by the Bipartite Contraction 
problem: Given a graph G and an integer k. can wc obtain a bipartite graph from 
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G by a sequence of at most k edge contractions in G? Tlie number of possible edge 
contractions in G is always less than the number of vertices of G, since every edge 
contraction reduces the number of vertices by exactly one. In practical instances 
of bipartization problems the similarity parameter k tends to be small [21], 
which makes these problems especially well-suited for parameterized algorithms. 
A graph problem with input G and k is fixed parameter tractable (FPT) if there 
is an algorithm with running time f{k) rfi^^\ where the function / depends only 
on k and not on the size of G. Considering the significant amount of interest the 
problems OCT and Edge Bipartization have received, we find it surprising 
that Bipartite Contraction has not yet been studied. 

In this paper we show that Bipartite Contraction is fixed parameter 
tractable when parameterized by the number k of edges to be contracted. The 
key ingredients of our algorithm fundamentally differ from the ones used in the 
above-mentioned algorithms for OCT and Edge Bipartization. In the algo- 
rithm for OCT by Reed, Smith, and Vetta [20] , iterative compression is combined 
with maximum flow arguments. The recent nearly linear time algorithm for the 
two problems, due to Kawarabayashi and Reed [12], uses the notion of odd mi- 
nors, together with deep structural results of Robertson and Seymour [22] about 
graphs of large trcewidth without large clique minors. Interestingly, Bipartite 
Contraction does not seem to be amenable to these approaches. 

Although our algorithm is based on iterative compression, it seems difficult to 
adapt the compression step from [20] for OCT to work for Bipartite Contrac- 
tion. Instead, we perform the compression step using a variant of the irrelevant 
vertex technique, introduced by Robertson and Seymour [22] (see also [23]). In 
particular, if the treewidth of the input graph is large, then we identify an irrel- 
evant edge that can be deleted from the graph without affecting the outcome. 
The irrelevant vertex technique has played a key role in the solutions of several 
problems (see e.g., [11, 13, 14]). 

Our algorithm crucially deviates from previous work in the manner in which it 
finds the irrelevant edge. While previous work has relied on large minor models 
as obstructions to small treewidth, ours uses the fact that any graph of high 
treewidth contains a large p- connected set X [5]. A vertex set X is p-connected if, 
for any two subsets Xi and X2 of X with \Xi \ = \X2\ < p, there are \Xi\ vertex- 
disjoint paths with one endpoint in Xi and the other in X2. Using p-connected 
sets in order to find irrelevant edges has several advantages. First, our algorithm 
avoids the huge parameter-dependence which seems to be an inadvertent side 
effect of applying the Robertson and Seymour's graph minors machinery. Second, 
our arguments are nearly self-contained, and rely only on results whose proofs 
are simple enough to be taught in a graduate class. Thus, even though our 
algorithm is not practical by any means, it is much closer to practicality than 
other algorithms based on the irrelevant vertex technique. It is an intriguing 
question whether some of the algorithms that currently use Robertson- Seymour 
machinery to find an irrelevant vertex can be modified in such a way, that they 
find an irrelevant vertex using p-connected sets instead. 
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Using p-connected sets in order to find an irrelevant vertex or edge is non- 
trivial, because p-connectivity is a more "implicit" notion than that of a large 
minor model. We overcome this difficulty by using important sets. Important sets 
and the closely related notion of important separators were introduced in [16] 
to prove the fixed-parameter tractability of multiway cut problems. The basic 
idea is that in many problems where terminals need to be separated in some 
way, it is sufficient to consider separators that are "as far as possible" from one 
of the terminals. Important separators turned out to be a crucial component, 
in some cases implicitly, in the solutions of cardinal problems in parameterized 
complexity [2, 3, 19, 17]. To the best of our knowledge, this is the first time the 
irrelevant vertex technique and important sets (or separators) arc used together. 
We believe that this combination will turn out to be a useful and powerful tool. 

2 Definitions and notation 

All graphs considered in this paper are finite, undirected, and simple, i.e., do not 
contain multiple edges or loops. Given a graph G, we denote its vertex set by 
V{G) and its edge set by E{G). We also use the ordered pair {V{G),E{G)) to 
represent G. We let n = \V{G)\ and m = \E{G)\. For two graphs Gi = 
and G2 = (V2,i?2), the disjoint union of Gi and G2 is the graph Gi U G2 = 
{Vi U V2,Ei U E2). The deletion of an edge e e E{G) yields the graph G ~ e = 
{V{G),E{G) \ e). For a set X C V{G), we write G[X] to denote the subgraph 
of G induced by X. A graph is connected if there is a path between every pair 
of its vertices. The connected components of a graph are its maximal connected 
subgraphs. For any set X C V{G), we write 5g{X) to denote the set of edges in 
G that have exactly one endpoint in X. We define dc^X) = \5{X)\. 

The contraction of edge xy in G deletes vertices a; and y from G, and replaces 
them by a new vertex, which is made adjacent to precisely those vertices that 
were adjacent to at least one of the vertices x and y. The resulting graph is 
denoted G/xy. Every edge contraction reduces the number of vertices in the 
graph by exactly one. We point out that several edges might disappear as the 
result of a single edge contraction. For a set 5 C E{G), we write G/S to denote 
the graph obtained from G by repeatedly contracting an edge from S until no 
such edge remains. It follows from the definition of an edge contraction that in 
order to obtain the graph G/S from S, it is necessary and sufficient to contract 
all the edges of some spanning forest of the graph {V{G),S). 

Let H he a, graph with V{H) = {hi, /12, • • ■ , he}- A graph G is H-contractible 
if H can be obtained from G by contracting edges. Saying that G is iJ-contractible 
is equivalent to saying that G has a so-called H-witness structure W, which is 
a partition of V{G) into witness sets W {hi) ,W {h2) ■ ■ ,W {hi) . The witness 
sets have the property that each of them induces a connected subgraph of G, 
and for every two hi,hj G y{H), there is an edge in G between a vertex of 
W{hi) and a vertex of W{hj) if and only if hi and hj are adjacent in H. 
Let G' = G[W{hi)] U •■• U G[W{ht)] be the graph obtained from G by re- 
moving all the edges of G, apart from the ones that have both endpoints in 
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the same witness set. In order to contract G to i7, it is necessary and suf- 
ficient to contract all the edges of some spanning forest F of G". Note that 
\E{F)\ = Y.Um{h.)\ - 1) = \V{G)\ - \V{H)\. 

A 2-coloring of a graph G is a function (j> ■ V{G) {1, 2}. We point out that 
a 2-coloring of G is merely an assignment of colors 1 and 2 to the vertices of G, 
and should therefore not be confused with a proper 2-coloring of G, which is a 
2-coloring with the additional property that no two adjacent vertices receive the 
same color. An edge uv is said to be good (with respect to 0) if 4'{u) ^ (t>{v), and 
uv is called bad (with respect to (/)) otherwise. A good component of is the vertex 
set of a connected component of the graph {V{G), E'), where E' C E is the set of 
all edges that are good with respect to (f>. Any 2-coloring of G defines a partition 
of V{G) into two sets and V^, which are the sets of vertices of G colored 1 
and 2 by (j), respectively. A set X C V{G) is a monochromatic component of (j) 
if G[X] is a connected component of G[V^^] or a connected component of G[V^]. 
We write A^^ to denote the set of all monochromatic components of <j>. The 
cost of a 2-coloring (p is defined as J2xeM^(\-^\ ~ Note that the cost of a 
2-coloring of G is if and only if is a proper 2-coloring of G. 

Let G be a graph. A tree decomposition of G is a pair (T, X ~ {Xt}tev{T))j 
where T is a tree and A:" is a collection of subsets of V{G), satisfying the following 
three properties: (1) Uuev{T)Xu = V; (2) yuv e E{G),3t e V{T) : {u,v} C 
Xt; and (3) Wv G V{G), T[{t \ v £ Xt}] is connected. The width of a tree 
decomposition is max(gy(y) \Xt \ — 1 and the treewidth of G, denoted tw{G), is 
the minimum width over all tree decompositions of G. 

The syntax of Monadic Second Order Logic (MSO) of graphs includes the 
logical connectives V, A, variables for vertices, edges, sets of vertices and sets 
of edges, the quantifiers V, 3 that can be applied to these variables, and the 
following five binary relations: 

1. u G U, where m is a vertex variable and U is a, vertex set variable; 

2. d G D, where d is an edge variable and D is an edge set variable; 

3. inc(d, m), where d is an edge variable, u is a vertex variable, and the inter- 
pretation is that the edge d is incident to the vertex u; 

4. adj(u,?;), where u and v are vertex variables and the interpretation is that 
u and V are adjacent; 

5. equality of variables representing vertices, edges, sets of vertices and sets of 
edges. 

3 Bipartite Contraction and the cost of 2-colorings 

In the Bipartite Contraction problem we are given a graph G and an integer 
k, and the task is to determine whether there exists a set S C E{G) of at most k 
edges such that G/ S is bipartite. The following lemma allows us to reformulate 
this problem in terms of 2-colorings of the graph G. 

Lemma 1. A graph G has a 2-coloring (j) of cost at most k if and only if there 
exists a set S C E{G) of at most k edges such that G/S is bipartite. 
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Proof. Suppose G has a 2-coloring (j> of cost at most k. We build an edge set S 
as follows. For each monochromatic component X G A4^, find a spanning tree 
T of G[X] and add the |X| — 1 edges of T to S. The total number of edges in 
S is exactly the cost of 0, so \S\ < k. It remains to argue that G" = G/S is 
bipartite. Let £ 5 be a bad edge in G, and let be the 2-coloring of G/uv 
that assigns color (j){u) = to the new vertex resulting from the contraction 
of uv, and that assigns color 4i{w) to every vertex w £ V{G) \ {w, v}. Since we 
contracted a bad edge, the cost of (f>' is 1 less than the cost of 0. Repeating this 
for every edge in S yields a 2-coloring 0" of G" of cost 0. This means that 0" is 
a proper 2-coloring of G', which implies that G' is bipartite. 

For the reverse direction, suppose there is a set S C E{G) of at most k edges 
such that G' = G/S is bipartite. We define G* to be the graph with the same 
vertex set as G and edge set S, i.e., G* = {V{G),S). Let W be a G'-witness 
structure of G. Observe that W{y) induces a connected component of G* for 
every y £ F(G'). Let be a proper 2-coloring of G'. We construct a 2-coloring 
0' of G as follows. For every v £ V{G), we set 4>'{v) = 0(2/), where y is the 
vertex in V{G') such that u G Since the monochromatic components of 

0' are exactly the connected components of the graph G* , and since G* contains 
exactly S edges, the cost of 0' is at most |5| < fc. □ 

An instance of the Cheap Coloring problem consists of a graph G and an 
integer fc, and the task is to decide whether G has a 2-coloring of cost at most 
k. Lemma 1 shows that the problems Bipartite Contraction and Cheap 
Coloring are equivalent. 

The deletion of an edge can not increase the cost of a 2-coloring, and can only 
decrease the cost of a 2-coloring by at most one. We state this as the following 
observation. 

Observation 1 Let (p be a 2-coloring of G of cost k. For any edge uv G E{G), 
the cost of (f> in G — uv is k or k — 1. 

Observation 1 allows us to use the well-known iterative compression technique 
of Reed, Smith and Vetta [20] to reduce the Cheap Coloring problem to the 
Cheaper Coloring problem. The Cheaper Coloring problem takes as input 
a graph G, an integer k, and a 2-coloring of G of cost fc -|- 1, and the task is 
to either find a 2-coloring of G of cost at most fc, or to conclude that such a 
coloring docs not exist. 

Lemma 2. // there is an algorithm for Cheaper Coloring that runs in time 
f{k) , then there is an algorithm for Cheap Coloring that runs in time 
f{k) n'^m. 

Proof. Suppose there exists an algorithm for Cheaper Coloring that runs 
in time /(fc) n^. Then we can solve an instance (G, fc) of Cheap Coloring by 
iterating over the edges ei , 62, . . . of G as follows. For every i G {1, . . . , m}, we 
define Gi to be the graph with vertex set V{G) and edge set Ei = {ej : j <i}. 
The graph Gi has a 2-coloring 0i of cost 0, which is at most fc. For the first fc 
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iterations, we trivially maintain a 2-coloring of cost at most k. Now, in iteration 
i of the algorithm, assume that we have a 2-coloring of cost at most k in G^. 
By Observation 1, the cost of (pi in G^+i is at most k + 1. If the cost of in 
Gi+i is at most fc, then we proceed to the {i + l)th iteration. Otherwise, we run 
the algorithm for Cheaper Coloring with input (G,+i, fc, (pi). If the algorithm 
concludes that G,;+i has no 2-coloring of cost at most k, then, by Observation 1, 
neither docs G. If, on the other hand, the algorithm outputs a 2-coloring (pt^i 
of Gi+i of cost at most k, then we proceed to the (i + l)th iteration. Since we 
call the algorithm for Cheaper Coloring at most m times, each time with 
parameter fc, the time bound follows. □ 

We have now almost reached the variant of the problem that will be the focus 
of attention in the remainder of this paper. For two disjoint vertex sets Ti and 
T2, a 2-coloring of G is a (Ti, T2)- extension if (p colors every vertex in Ti with 1 
and every vertex in T2 with 2. In the Cheap Coloring Extension problem we 
are given a bipartite graph G, two integers k and t, and two disjoint vertex sets 
Ti and T2 such that |Ti| -|- IT2I < t. The objective is to find a (Ti, -extension 
(j) of cost at most k, or to conclude that such a 2-coloring docs not exist. We 
will say that a (Ti, T2)-extension (/> is a cheapest (Ti, T2)-cxtcnsion if there is no 
(Ti, T2)-cxtension p' with strictly lower cost than (p. 

Lemma 3. If there is an algorithm for Cheap Coloring Extension that runs 
in time f{k,t) n'^ , then there is an algorithm for Cheaper Coloring that runs 
in time 4'=+i/(fc, 2k + 2) n" . 

Proof. Given an f{k,t)n'^ time algorithm for Cheap Coloring Extension, 
we show how to solve an instance (G, fc, cp) of Cheaper Coloring. Let S be 
the set of all bad edges in G with respect to (p, and let X be the set of endpoints 
of the edges in S. Since (p has cost fc -I- 1, we have \X\ <2k + 2. We create 4*^+^ 
instances of Cheap Coloring Extension as follows. 

For every possible partition X into two sets Xi and X2, we set k' = k and 
t = |^i| + 1^2!, and we build a graph G(Xi, X2) from G in the following way. As 
long as there is an edge ilv e S such that u and v are both in Xi or both in X2, 
contract the edge uv, put the new vertex resulting from the contraction into the 
set Xi that u and v belonged to, and decrease fc' by 1. Since the cost of (p is at 
most fc -I- 1 , we contract at most fc -I- 1 edges in this way, and hence fc' > — 1 . When 
there are no such edges left, then we discard this partition of X into Xi and X2 
if fc' = —1; otherwise, we continue to build an instance of Cheap Coloring 
Extension as follows. Delete all edges uv e S with u £ Xi and v e Xj such 
that i ^ j. Since S contains all the edges of G that are bad with respect to </>, 
and each of the edges of 5* is either contracted or deleted, the resulting graph 
G(Xi, X2) has no bad edges with respect to (p and is therefore bipartite. Thus we 
obtain an instance (G(Xi, X2), fc', Xi, X2) of Cheap Coloring Extension 
with fc' > 0. 

We now show that (G, fc, (p) is a yes-instance of Cheaper Coloring if and 
only if there is a partition oiX into Xi and X2 such that {G{Xi, X2), fc', t, Xi, X2) 
with fc' > is a yes-instance of Cheap Coloring Extension. 
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Suppose that (G, k, (p) is a yes-instance of Cheaper Coloring. Then there 
exists a 2-coloring 0* of G of cost at most k. Let Xi and X2 be the vertices of X 
that are colored 1 and 2 by 0* , respectively. Consider the set 5' C 5 of edges that 
were contracted in order to obtain G{Xi, X2) from G in the way described earlier. 
Since every edge in S" is bad with respect to (/)*, the cost of (/)* decreased by 1 
with every edge contraction. Hence, (j)* is an (Xi, X2)-extcnsion of G{Xi,X2) of 
cost k' . We conclude that (G(Xi, X2), fc', i, Xi, X2) is a yes-instance of Cheap 
Coloring Extension. 

For the reverse direction, suppose there is a partition of X into Xi and 
X2 such that (G(Xi, X2), fc', i, Xi, X2) is a yes-instance of Cheap Coloring 
Extension with k' > 0, i.e., the bipartite graph G{Xi,X2) has an {Xi,X2)- 
extension ip of cost at most k'. Let S" C 5' be the set of edges that were contracted 
in G to create the instance {G{Xi, X2), k' , t, Xi, X2). Since k' = k — \S'\ > 0, we 
have that |S"| < k. We define a 2-coloring 6* of G by coloring both cndpoints of 
every edge uv in S' with the color that assigned to the vertex resulting from 
the contraction of the edge uv, and coloring all other vertices in G with the color 
they received from ip. Clearly, the cost of is at most fc' + |5"| = fc, and therefore 
(G, fc, (p) is a yes-instance of Cheaper Coloring. 

Since we need to run the f{k,t)n'^ time algorithm for Cheap Coloring 
Extension at most 4*^+^ times, with parameters k' < k and t ^ \X\ < 2k + 2 
at each iteration, the time bound follows. □ 

The next section is devoted to showing that Cheap Coloring Extension 
is fixed parameter tractable when parameterized by k and t. The reason wc want 
to work with the Cheap Coloring Extension problem rather than with the 
Bipartite Contraction problem directly is that, as we shall see in Section 4.2, 
Cheap Coloring Extension is a "cut" problem, and is therefore amenable 
to techniques based on important separators [16]. 

4 Solving Cheap Coloring Extension in FPT time 

In this section, we present an algorithm for the Cheap Coloring Extension 
problem. For the remainder of this section, let (G, fc, t, Ti, T2) be a given instance 
of Cheap Coloring Extension. Recall that G is bipartite. The high level 
structure of our algorithm is as follows. If the treewidth of G is bounded by a 
function of fc and i, then we can use standard dynamic programming techniques 
to solve the problem in time f{k,t) n. If, on the other hand, the treewidth of G 
is large, then we can find a large set which is "highly connected". In this case 
we show how to find in f{k,t)n^^^^ time an edge e G E{G) such that G has a 
(Ti, T2)-extension of cost at most fc if and only if G — e does. We then re-run 
our algorithm on G — e. 

To make the distinction between the two cases in our algorithm more precise, 
we use the following notion, due to Diestel et al. [5]. A set X C V{G) is p- 
connected'm G if \X\ > p and, for all subsets Xi,X2 C X with \Xi\ = \X2\ < p, 
there are \Xi\ vertex-disjoint paths in G with one endpoint in Xi and the other 
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in X2. Diestel et al. [5] prove the following statement in the proof of Proposition 
3 (ii): a h > p and G contains no p-connected set of size h, then G has treewidth 
< h + p — 1. (In fact, they prove a stronger version of this statement using 
the notion of an externally p-connected set, but we do not need this stronger 
assertion for our purposes.) We define a set X to be well-connected if it is |X|/2- 
connectcd. Using this definition, the result of Diestel et al. can be seen to imply 
the following. 

Theorem 1 ([5]). If tw{G) > w, then G contains a well-connected set of size 
at least 2w/3. 

The proof of Theorem 1 is constructive. In fact, given G and if, a tree de- 
composition of width at most w or a well-connected set of size at least 2w/3 
can be computed in time c™n'-'(^) for some constant c [5]. We use Theorem 1 to 
compute either a tree-decomposition of G of width at most 3{4k^) 1 4'*'^ -I- 3 or a 
well-connected set Y of size at least 2(4fc^) i 4'*'^" -f- 2. Section 4.1 deals with the 
first case, whereas the second case is covered in Section 4.2. 

4.1 Small treewidth 

Suppose our algorithm has found a tree-decomposition of G of width at most 
3(4fc^) i 4'*'' -f 3. We will use the following celebrated theorem by Courcelle [1] 
to solve the Cheap Coloring Extension problem in this case. 

Theorem 2 ([1]). There is an algorithm that tests whether an MS O formula ijj 
holds on a graph G of treewidth w, in time f{\'ip\, w) n. 

We remark that Theorem 2 holds even when the input graph G is supple- 
mented by unary relations ai , . . . , on vertices and edges and the MSO formula 
ip is allowed to use these relations [1] . 

Lemma 4. There is an algorithm that, given an instance (G, k,t,Ti,T2) of 
Cheap Coloring Extension together with a tree- decomposition of G of width 
w, solves the instance in time f{k,t,w)n. 

Proof The Cheap Coloring Extension problem can be formulated in MSO. 
To see this, we define two unary relations on the vertex set of G: ai{v) is true 
if V e Ti and a2{v) is true if w G T2. We claim that the following formula holds 
on G if and only if G has a (Ti, r2)-extension of cost at most k. 

^ = 3SiC 1/(G), ^2 C 1/(G), C C E{G),ei G S(G), 62 G E{G), ...,6^6 E^G) : 
C = {ei,e2,...e/c} AVw G V{G),ai{v) v e Si A a2(w) v e S2 
A yuv G E{G) {{u e Si A V e S2) V {u e S2 A V e Si) 
V (VXi C V{G),u ^XiyveXiV3xyeG{xeXiAy(^ Xi))) 

The interpretation of ip is as follows. The sets Si and S2 are the vertices of G 
colored 1 and 2, respectively, by a (Ti, T2)-extension cj). The edge set C contains 
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all the edges, k in total, of a spanning tree of each monochromatic component of 
cf>. Every edge uv of G is either good (third line of the formula), or bad (fourth 
line). If uv is a bad edge and uv belongs to C, then we can take xy = uv. Suppose 
uv is a bad edge and uv ^ C. Let X be the monochromatic component of (j) such 
that G[X] contains uv, and let Xi be any subset of V{G) containing u but not 
V. Since G contains all the edges of a spanning tree of G'[X], there exists a path 
P in G[X] from u to v, using only edges of G. Hence we can take xy to be the 
first edge of P that has one endpoint (x) in Xi and the other endpoint (y) in 
V{G)\X,. 

Note that, for simplicity, we took some liberties in the formulation of the MSO 
formula. For example, C = {ei, 62, ... 6^} is not really an MSO formulation, but 
it can easily be translated into MSO by demanding that every edge 6 C and 
that any edge in G must be one out of ei, 62, . . . e^-. Similarly, operators such as 
— > can be reformulated using the A, V, and -1 operators. Applying Theorem 2 
to this formulation completes the proof of the lemma. □ 

We would like to remark that, given an instance {G,k,t,Ti,T2) of Cheap 
Coloring Extension together with a tree-decomposition of G of width w, it is 
possible to solve that instance in time (t(;-|-l)*^*-'"^n, using standard dynamic pro- 
gramming techniques. This gives a much faster algorithm than the one obtained 
by applying Theorem 2 on the MSO formula. However, it would take several 
pages to give the details of such an algorithm, and for the main purpose of this 
paper, we find it sufficient to handle the case of small treewidth by Lemma 4. 

4.2 Large treewidth and irrelevant edges 

Suppose our algorithm did not find a tree-decomposition of G of small width, 
but instead found a well-connected set Y of size atleast2(4fc2)t4'^'= -1-2. We use 
Y throughout this section to refer to this specific set. An edge e G E{G) is said 
to be irrelevant if it satisfies the following property: G has a (Ti, r2) -extension 
of cost at most k if and only if G — e docs. We will show that the presence of 
the large well-connected set Y guarantees the presence of an irrelevant edge e in 
G. Hence we find such an irrelevant edge e in G, delete it from the graph, and 
solve Cheap Coloring Extension on the instance (G — e, fc, i, Ti, T'2). Since 
each iteration of this process deletes an edge, we will find a tree-decomposition 
of the graph under consideration of small width after at most m iterations, in 
which case we solve the problem as described in Section 4.1. 

Observation 3 below gives a hint about how we are going to identify an 
irrelevant edge of G. We first need a basic observation about bad edges. 

Observation 2 Let (j) be a 2-coloring of G. No bad edge has both endpoints in 
the same good component of (f). 

Proof. Suppose, for contradiction, that G has a bad edge uv such that both u 
and V belong to a good component G of </>. Since uv is bad, we have 4i{u) = (f>{v). 
Every good component is connected, so there is a path P in G, starting in u and 



9 



ending in v, consisting only of good edges. The path P must contain an even 
number of edges, implying that P and uv together form an odd cycle in G. This 
contradicts the assumption that G, which is part of the instance (G, k, t, Ti, T2) 
of Cheap Coloring Extension that we are solving, is bipartite. □ 

Observation 3 Let uv G E{G). If (j) is a cheapest (Ti,T2)- extension of G — uv 
and u and v are in the same good component of 4>, then uv is irrelevant. 

Proof. Suppose u and v belong to the same good component of a cheapest 
(Ti, r2)-extension (j) oi G — uv. Note that </> is a 2-coloring of G, and that the 
edge uv in G is good with respect to as a result of Observation 2. Hence (f) is 
a (Ti, r2)-extension of G, and the cost of in G equals the cost of in G — uv. 
As a result of Observation 1, must be cheapest (Ti, T2)-extension of G. Since 
the cost of a cheapest (Ti, T2)-extension G — uv equals the cost of a cheapest 
(Ti, r2)-extension of G, the edge uv is irrelevant by definition. □ 

In order to use Observation 3, we need to identify vertices that will end up 
in the same good component of some cheapest (Ti, T2)-extension of G — uv. 
The vertices in Y are good candidates, because they are so highly connected to 
each other. Over the next few lemmas we formalize this intuition. We start with 
two observations that will allow us, in the proof of Lemma 5 below, to bound 
the number of bad edges and the number of good components of a cheapest 
(Ti, T2)-extension of G of cost at most k. 

Observation 4 Let (j) he a 2-coloring of G. If (j) has cost at most k, then there 
are less than 2k^ bad edges. 

Proof. Let M'^ = {X g : |X| > 2} be the set of monochromatic compo- 

nents of (j) containing more than one vertex, and let G' be the disjoint union of 
the graphs induced in G by the elements of AiL, i.e., G' = [JxeM' C![X]. By 
definition, the cost of is J2x&M4,i\-^\^ ^ J2xeM'^(\-^\~ which is exactly 
the number of edges in any spanning forest of G'. Since any forest on at most k 
edges without isolated vertices has at most 2k vertices, we have |1/(G')| < 2k. 
Every bad edge has both endpoints in V{G'), so the number of bad edges is at 
most (^2'') < 2fc2. □ 

Observation 5 Let (j) he a cheapest {Ti,T2)- extension of G. Every good compo- 
nent of (j) contains a vertex from Ti U T2 . 

Proof. Suppose a good component G of </> does not contain any vertex from 
Ti U r2 . We build a coloring (p' from </> by changing the color of every vertex in 
G, leaving the color of every other vertex unchanged, i.e., = 3 — if 

V G C, and 4>'{v) = (j){v) iiv^G. Since (j){v) = 4>'{v) for every w e Ti U T'2, 0' is 
a (Ti, r2)-extension of G. Furthermore, every edge that was good with respect 
to (j) is good with respect to 4>' , while every edge in 6g{C) was bad with respect 
to (j) and is good with respect to (j)' . Since G is connected, there is some vertex 

V G G which is incident to at least one edge that was bad with respect to cf). On 
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the other hand, all edges incident to v are good with respect to (j)' . Hence {w} is a 
monochromatic component of 0', but {w} was not a monochromatic component 
of 0. This means that jA^^I < This, together with the observation that 

the number of edges that are bad with respect to 0' is not more than the number 
of edges that were bad with respect to 0, implies that the cost of 0' is strictly 
less than the cost of 0. This contradicts the assumption that (/> is a cheapest 
(Ti , T2)-extension of G. □ 

The next lemma shows that almost all the vertices of Y appear in the same 
good component of any cheapest (Ti, T2)-extension of G of cost at most k. 

Lemma 5. Let uv G E{G), and let (f> he a cheapest {Ti,T2)- extension ofG — uv 
of cost at most k. There exists exactly one good component G* of (j) satisfying 
\Y\C*\ < 2fc^, and every other good component C' of (f> satisfies |y nC"| < 2fc^. 

Proof. Let C be a good component of 0. We first show that either |1^\C| < 2fc^ or 
|y n C| < 2fc2. Suppose for contradiction that |y n C| > 2fc2 and \Y \G\ > 2k'^. 
We define Yi to be the smallest of the two sets Y n C and Y \G, and Yz to 
be any subset of the largest of the two sets such that II2I = l^il- Note that 
2fc^ + 1 < lYil = |i2| < |y|/2. By the definition of a well-connected set, there 
are |Yi| > 2k^ + l vertex-disjoint paths with one endpoint in FnC and the other 
in F \ C. At least 2fc^ of these paths exist in G — uv, and each of those must 
contain an edge in Sg-uv{C)- Since each edge in Sg-uv{C) is bad, it follows that 
cj} has at least 2k^ bad edges, contradicting Observation 4. 

Now suppose for contradiction that (f) does not have a good component C* 
with |y \G*| < 2fc^. Then \Yr]C\ < 2fc^ for every good component G of </>, as we 
showed earlier. Since </> has at most t = |Ti| -I- IT2I good components as a result 
of Observation 5, at most 1 2k^ vertices of Y appear in good components. The 
fact that the size of Y is much larger than t2k^, together with the observation 
that every vertex of G appears in a good component by definition, yields the 
desired contradiction. Hence we know that (j) has a good component G* with 
|y \ G*| < 2fc^. The uniqueness of G* follows from the sizes of Y and G*, and 
the fact that the good components of (f> are pairwise disjoint. □ 

There are two problems with how to exploit the knowledge obtained from 
Lemma 5. The first is that, even though we know that almost all the vertices 
of Y appear in the same good component together, we do not know exactly 
which ones do. The second problem is that we are looking for an edge with both 
endpoints in the same good component, and Y could be an independent set and 
thus not immediately give us an edge to delete. We deal with both problems by 
employing the very useful notion of important sets. For two vertices x,y G V{G), 
we say that a set X C V{G) is (x, y)-important if it satisfies the following three 
properties: (1) x £ X and y ^ X; (2) G[X] is connected; and (3) there is no 
X' D X, y ^ X' such that dciX') < daiX) and G[X'] is connected. The 
following theorem was first proved in [2]. We use here the formulation in [15], 
because that one best fits the purposes of this paper. 
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Theorem 3 ([2, 15]). Let x,y be two vertices in a graph G. For every p > 0, 
there are at most 4P [x^y) -important sets X such that dciX) < p. Furthermore, 
these important sets can be enumerated in time 4P ■ n'~'^^\ 

Suppose G — uv has a cheapest (Ti, T2)-extension (f) of cost at most k. We will 
use the important sets together with Lemma 5 to identify vertices in Y which 
must be in the unique good component C* of </> that contains all but at most 
2fc^ vertices of Y. We first build a graph G* from G by adding a new vertex y* 
and making y* adjacent to all vertices in Y. We then enumerate all x S Ti U T2 
and all (x, 2/*)-important sets X in G* such that dciX) < Ak^ . By Theorem 3, 
this can be done in time 4^*^ rf^^^ for each choice of x. Finally, we define the set 
Z to be the union of all enumerated sets X. In other words, 

Z ^{we V{G*) : 

3x G Ti U T2, X C V{G*), w ex, da- {X) < Ak^ and X is {x, y*)-important} 

Observe that, given G and Y , Z can be computed in time 1 4"""' rfi^^^ . We will use 
the set Z in the following way. First we show that if there is an edge uv S E{G) 
such that neither u nor v are in Z , then the edge uv is irrelevant. Then we show 
that such an edge always exists. 

Lemma 6. Let uv G E{G) such that u ^ Z and v ^ Z . Then uv is irrelevant. 

Proof. Let be a cheapest (Ti, T2) -extension oi G — uv with cost at most k. Let 
G* be a good component of (p such that [F \ C*| < 2k^ . By Lemma 5, such a 
component C* exists, and every other good component C oi (p satisfies |y nC| < 
2fc^. We prove that both u and v are in C* . Suppose u ^ G* . Then m G C for some 
other good component of 4>. Now, C induces a connected subgraph in G — uv, 
and all edges leaving C in G — uv are bad with respect to (p. Since (p has less 
than 2fc^ bad edges by Observation 4, it follows that dG-uv{G) < 2k'^, and thus 
daiC) < 2fc2. Furthermore, because |y n G| < 2k'^, we have that dG'{G) < Ak'^. 
Finally, by Observation 5, G must contain a vertex x G Ti U 72- Hence there 
must be a (cc, y*)-important set X such that G C X and d{X) < Ak^ in G*. 
But G C X C Z, which implies that u £ Z, contradicting the assumption that 
u ^ Z. The proof that u G G* is identical. We conclude that both u and v are 
in G*, and hence, by Observation 3, uv is irrelevant. □ 

Lemma 7. G contains an edge uv such that u ^ Z and v ^ Z . 

Proof. We first prove that daiZ) < {Ak^)tA'^^^ and \Z r\Y\ < {Ak^)tA^''\ For 
the first inequality, it suSices to show that dciZ) < {Ak'^)tA'^'' , because G is 
a subgraph of G* . However, Z is the union of at most tA'^'' important sets X, 
where dc {X) < Ak"^ for each set. Hence the first inequality follows. To see that 
\Z r\Y\ < {Ak'^) t A'^'^^ , observe that each (x, i/*)-important set X in G* with 
do'iX) < Ak"^ contains at most 4fc^ vertices of Y , since each vertex in F is a 
neighbour of y* . 
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In order to prove that G contains an edge uv with u ^ Z and w ^ Z, we 
arbitrarily partition Y into Yi and Y2 such that |Yi| = \Y2 \ and Zr\Y C Y2. Since 
|y| > 2(4fc2)t44'^' +2 by assumption and we showed that \Zf^Y\ < (4fc2)^4'"=^ 
such a partition always exists. By the definition of a weh-connected set, there 
are jYil = (4fc^)i4""° + 1 vertex-disjoint paths starting in Yi and ending Y2. 
For every i < \Yi\, let UiVi be the first edge on the ith such path, with m £ Yi. 
Recall that Z n Y C Y2 by assumption. Since all of the u^'s are in Yi, none of 
them arc in Z. Thus, each Vi that belongs to Z contributes one to dciZ), as 
then UiVi e Sc{Z). Since we bounded daiZ) from above by (4fc^)t4'''^ at the 
start of this proof, not every Vi can belong to Z . Hence there is an edge UiVi 
with neither endpoint in Z. □ 

We are now ready to state the main lemma of this section. 

Lemma 8. Cheap Coloring Extension can he solved in time J{k,t)n^^^\ 

Proof. Let (G, fc, t, Ti, T2) be an instance of Cheap Coloring Extension, and 
let / be an appropriate function that does not depend on n. We first apply Theo- 
rem 1 and the remark immediately following it to compute, in time /(fc, t) n'-''^^\ 
either a tree-decomposition of G of width at most 3(4fc^)t4""^' -1- 3 or a well- 
connected set Y of size at least 2(4fc^) 1 4"""' -I- 2. If we get a tree-decomposition 
of small width, we apply Lemma 4 to solve the problem in additional time 
f{k,t)n, after which we terminate and output the answer. If we find a well- 
connected set Y, we continue to find an irrelevant edge e £ E{G) and delete it 
from G. Lemmas 6 and 7 guarantee that such an edge always exists. In order 
to find e, we first compute the set Z. We already argued that this can be done 
m time f{k,t)n'^^^l We can find an irrelevant edge as explained in the proof 
of Lemma 7 in additional polynomial time, since this amounts to computing 
Z n Y, choosing Y2 to contain the whole intersection and as many more ver- 
tices as needed to obtain |Yi| = IY2I, and checking all edges leaving Yi to find 
one whose endpoints do not belong to Z. The total running time of this whole 
procedure is clearly f{k,t) n'~'^^\ 

After an irrelevant edge e is deleted from G, we run the whole procedure 
on (G — e,k,t,Ti,T2). This can be repeated at most \E{G)\ = n*^^^^ times, and 
hence the total running time f {k , t) n'-"-^^ follows. □ 

Our main result immediately follows from Lemmas 1, 2, 3, and 8. 

Theorem 4. Bipartite Contraction is fixed parameter tractable when pa- 
rameterized by k. 

We end this section with a remark on the running time. If we use Lemma 4 
in the proof of Lemma 8, then the parameter dependence of the whole algorithm 
is dominated by a very large function in k [1]. However, as we remarked after 

Lemma 4, we can obtam a runnmg time ot (4^'' >) n = 2 n tor the 
small treewidth case of the proof of Lemma 8. This is because the treewidth 
of the instance at hand is 44'='fcO(i) = 44fe"o(iogfc) ^ ^0{k^) ^^g^ ^^^^i 
treewidth case applies. This gives a total running time of of 22°' 'nO(i) for our 
algorithm for Bipartite Contraction. 
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5 Concluding remarks 



For completeness, we would like to mention that Bipartite Contraction is 
NP-complete; a polynomial time reduction from Edge Bipartization can be 
obtained by replacing every edge of the input graph by a path of sufficiently large 
odd length. A highly relevant question is whether Bipartite Contraction 
admits a polynomial kernel, meaning that there is a polynomial time algorithm 
that transforms an instance {G,k) to an instance {G',k') of size g{k), where 
g is a polynomial in k. The mentioned NP-complctcncss reduction from Edge 
Bipartization is parameter preserving, as k remains the same. However, it is 
not known whether or not Edge Bipartization admits a polynomial kernel. 
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